Goto

Collaborating Authors

 time faster performance


AWS Launches Graviton3 Processors For Machine Learning Workloads

#artificialintelligence

Amazon Web Services (AWS) announced the launch of the third generation of its AWS Graviton chip-powered instances, the AWS Graviton3, will power all-new Amazon Elastic Compute 2 (EC2) C7g instances, which are currently available in preview, three years after the original version of the processors was released. According to AWS, the new Graviton3-powered instances will give up to 25% faster compute performance and 2x more excellent floating-point performance than the current generation of AWS EC2 C6g Graviton2-powered instances be unveiled at the AWS re:Invent 2021 conference in Las Vegas. According to AWS Graviton2 instances, the new Graviton3 instances are up to 2x quicker when performing cryptographic workloads compared to the business. According to AWS, the new Graviton3-powered instances will give up to 3x more excellent performance for machine learning workloads than Graviton2-powered instances, including support for bfloat16. The AWS Graviton chips are Arm-based 7nm processors custom-built for cloud workloads by Annapurna Labs, an Israeli engineering startup AWS bought roughly six years ago.


Amazon announces Graviton3 processors for AI inferencing

#artificialintelligence

At its re:Invent 2021 conference today, Amazon announced Graviton3, the next generation of its custom ARM-based chip for AI inferencing applications. Soon to be available in Amazon Web Services (AWS) C7g instances, the company says that the processors are optimized for workloads including high-performance compute, batch processing, media encoding, scientific modeling, ad serving, and distributed analytics. Alongside Graviton3, Amazon unveiled Trn1, a new instance for training deep learning models in the cloud -- including models for apps like image recognition, natural language processing, fraud detection, and forecasting. It's powered by Trainium, an Amazon-designed chip which the company last year claimed would offer the most teraflops of any machine learning instance in the cloud. As companies face pandemic headwinds including worker shortages and supply chain disruptions, they're increasingly turning to AI for efficiency gains.